Reliability Evaluation of Distributed Computer Systems Subject to Imperfect Coverage and Dependent Common-Cause Failures

نویسندگان

  • Liudong Xing
  • Akhilesh Shrestha
چکیده

Imperfect coverage (IPC) occurs when a malicious component failure causes extensive damage due to inadequate fault detection, fault location or fault recovery. Common-cause failures (CCF) are multiple dependent component failures within a system due to a shared root cause. Both imperfect coverage and common-cause failures can exist in distributed computer systems and can contribute significantly to the overall system unreliability. Moreover they can complicate the reliability analysis. In this study, we propose an efficient approach to the reliability analysis of distributed computer systems (DCS) with both IPC and CCF. The proposed methodology is to decouple the effects of IPC and CCF from the combinatorics of the solution. The resulting approach is applicable to the computationally efficient binary decision diagrams (BDD) based method for the reliability analysis of DCS. We provide a concrete analysis of an example DCS to illustrate the application and advantages of our approach. Due to the consideration of IPC and CCF, our approach can evaluate a wider class of DCS as compared with existing approaches. Due to the nature of the BDD and the separation of IPC and CCF from the solution combinatorics, our approach has high computational efficiency and is easy to implement, which means that it can be easily applied to the accurate reliability analysis of large-scale DCS subject to IPC and CCF. The DCS without IPC or CCF appear to be special cases of our approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy Reliability Evaluation of a Repairable System with Imperfect Coverage, Reboot and Common-cause Shock Failure

In the present investigation, we deal with the reliability characteristics of a repairable system consisting of two independent operating units, by incorporating the coverage factor. The probability of the successful detection, location and recovery from a failure is known as the coverage probability. The reboot delay and common cause shock failure are also considered. The times to failure of t...

متن کامل

Reliability Evaluation of Multi-state Systems Subject to Imperfect Coverage using OBDD

This paper presents an efficient approach based on OBDD for the reliability analysis of a multi-state system subject to imperfect fault-coverage with combinatorial performance requirements. Since there exist dependencies between combinatorial performance requirements, we apply the Multi-state Dependency Operation (MDO) of OBDD to deal with these dependencies in a multi-state system. In addition...

متن کامل

MTBF evaluation for 2-out-of-3 redundant repairable systems with common cause and cascade failures considering fuzzy rates for failures and repair: a case study of a centrifugal water pumping system

In many cases, redundant systems are beset by both independent and dependent failures. Ignoring dependent variables in MTBF evaluation of redundant systems hastens the occurrence of failure, causing it to take place before the expected time, hence decreasing safety and creating irreversible damages. Common cause failure (CCF) and cascading failure are two varieties of dependent failures, both l...

متن کامل

Reliability Analysis of K-out-of-n: G Redundant System in the Presence of Lethal & Non-lethal Common Cause Shock Failures and with Imperfect Fault Coverage

This paper deals with the Reliability analysis of K-out-of-n : G redundant system in the presence of Lethal & Non-lethal Common Cause Shock failures (CCS) along with imperfect fault coverage. S.Akhtar (1994) has discussed the reliability analysis of k-out-of-n: G redundant system with Perfect and imperfect fault coverage and derived Reliability measures namely Rs (t), As (t), MTTF, MTBF. K.Mall...

متن کامل

Calculation and Analysis of Reliability with Consideration of Common Cause Failures (CCF) (Case Study: The Input of the Dynamic Positioning System of a Submarine)

Abstract The reliability and safety of any system is the most important qualitative characteristic of a system. This qualitative characteristic is of particular importance in systems whose functions are under various stresses, such as high temperature, high speed, high pressure, etc. A considerable point, which is rarely taken into account when calculating the reliability and safety of syst...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006